
SEQUENCE LISTING 

<110> Mclninch, James 

<12 0> COMPUTATIONAL NUCLEIC ACID CODING AND FEATURE ANALYSIS 

<13 0> 04983 . 0220 . OOUSOO 

<140> 09/698,213 

<141> 2000-10-30 

<160> 13 

<170> Patentin versic 

<210> 1 

<211> 2165 

<212> DNA 

<213> , Arabidopsis thaliana 
<220> 

<221> unsure 

<222> (1) . . . (2165) 

<223> Unsure at all n locations 

<220> 

<223> Ecotype Landsberg, genomic DNA 

<400> 1 



tactcaaaaa 


tatattccat 


gcttaattag 


gccggattcg 


cggtgacgat 


gcaccaagag 


60 


cggtttttcc 


gagcattgta 


ggccgtcctc 


gccacaccgg 


tgtgatggtt 


gggatgggac 


120 


aaaaggatgc 


ttatgttgga 


gacgaggctc 


aatcaaaacg 


tggtatcttg 


actctgaagt 


180 


acccaattga 


gcatggaatt 


gttaataatt 


gggatgacat 


ggagaagatt 


tggcatcaca 


240 


ctttctacaa 


tgagcttcgt 


gttgcccctg 


aagaacatcc 


ggttctcttg 


accgaagctc 


300 


ctctcaatcc 


gaaagctaac 


cgtgagaaga 


tgactcagat 


catgtttgag 


acattcaata 


360 


ctcctgctat 


gtatgttgcc 


attcaagctg 


ttctctcact 


ctatgccagt 


ggccgtacta 


420 


ctggtcagta 


cattactaca 


ttctttttat 


accgtttggt 


tgaaataaaa 


ttcggtttgg 


480 


ttcgattcga 


gtttgctctc 


attattttta 


ttttgttggt 


taggtattgt 


tttggactcc 


540 


ggagatggtg 


tgagccacac 


ggtaccaatc 


tacgagggtt 


atgcacttcc 


acacgcaatc 


600 


ctgcgtcttg 


atcttgcagg 


tcgtgaccta 


accgaccacc 


ttatgaaaat 


cctgacagag 


660 


cgtggttact 


ctttcaccac 


aactgctgag 


cgtgagattg 


ttagagacat 


gaaggagaag 


720 


ctctcttaca 


ttgccttgga 


ctttgaacaa 


gagctcgaga 


cttccaaaac 


aagctcatcc 


780 


gttgagaaga 


gcttcgagct 


gccagacggt 


caagtgatca 


ccatcggggc 


agagcgtttc 


840 







9 










cgatgccctg 


aagttctgtt 


tcagccatcg 


atgatcggaa 


tggaaaatcc 


gggaattcat 


900 


gaaactactt 


acaactcaat 


catgaaatgt 


gatgtggata 


tcaggaagga 


tctttatgga 


960 


aacattgtgc 


ttagtggtgg 


caccacaatg 


ttcgatggga 


ttggtgatag gatgagtaaa 


1020 


gagatcacag 


cgttggctcc 


aagcagtatg 


aacatcaaag 


tggtggctcc 


accggaaagg 


1080 


aagtacagtg 


tctggatcgg 


tggctctatc 


ttggcttccc 


tcagtacttt 


ccagcaggta 


1140 


aattacttac 


tatacttaat 


acataaagtc 


tattagtgat 


ttgatgtata 


aagtgttaca 


1200 


aaaatgtgtt 


ccaaatttgc 


agatgtggat 


tgcgaaagcg 


gagtatgatg 


aatctggacc 


1260 


gtcaatcgtc 


cacaggaagt 


gcttctgatc 


aaaagtcacc 


aagtaaaaca 


agagcggtaa 


1320 


aaattttgat 


atcagttttt 


caccctgaag 


ccagttgcta 


taattactca 


caacttctct 


1380 


atttgtgttc 


ttttattctt 


gtccctcgtt 


gttcatttta 


atctcttttt 


tgcaacaaag 


1440 


caacttaaaa 


aaacagagca 


gtcattaaca 


gaatgttatt 


attatatata 


tgtatacata 


1500 


ttagtataca 


cccattattt 


cattaaaaca 


tttatcatat 


aaggatagga 


ttctatacat 


1560 


cgatatattt 


attttgttga 


cactattcag 


cacatgctta 


tgtcttatct 


tgttagtata 


1620 


tgtaaccaaa 


gacaaataat 


agatgctaca 


aattgttttc 


tttgaagcaa 


aaatttcaat 


1680 


cttaaaattg 


tttttttcca 


ggttacacaa 


aaaaaacttg 


tagtttgtaa 


attttctata 


1740 


caattttggg 


gatctcaaca 


agaacatgaa 


cttcaacttc 


tagtcatatg 


acgacctgag 


1800 


tctgcgcggc 


tgtgaatctc 


tttgctgcag 


taaatgttta 


caagtggtgt 


gtaaattggt 


1860 


actgattcaa 


aagctttaag 


aaatctacac 


atttcgtgaa 


attatttagc 


agacttgata 


1920 


ttaaaaatct 


aggataaaat 


gactatccaa 


agacaaatag 


gactgtttca 


catgttcccc 


1980 


tgattcttgt 


agctcataac 


tcatcagcag 


ttaacttttc 


tacctcatac 


acgctcgcaa 


2040 


tncgtttgga 


attatcagct 


ntaatttttc 


taattctttg gaaattatta 


gcagctcgat 


2100 


caaatggggc 


atggcttctt 


cttctatctg 


caactcatct 


aaactttcca 


tgaagaaaca 


2160 


aagct 












2165 



<210> 2 

<211> 6 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 



<400> 2 

Arg Phe Phe Arg Ala Leu 



f 



1 5 



<210> 3 

<211> 7 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 

<400> 3 

Ala Val Leu Ala Thr Pro Val 
1 5 



<210> 4 

<211> 21 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 4 

Trp Leu Gly Trp Asp Lys Arg Met Leu Met Leu Glu Thr Arg Leu Asn 
15 10 15 

Gin Asn Val Val Ser 
20 



<210> 5 

<211> 35 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 5 

Ser Thr Gin Leu Ser Met Glu Leu Leu He He Gly Met Thr Trp Arg 
15 10 15 

Arg Phe Gly He Thr Leu Ser Thr Met Ser Phe Val Leu Pro Leu Lys 
20 25 30 

Asn He Arg 
35 



<210> 6 

<211> 56 

<212> PRT 

< 2 1 3 > Unknown 



<220> 



<223> Describes a predicted protein sequence 
<400> 6 

Leu Thr Glu Ala Pro Leu Asn Pro Lys Ala Asn Arg Glu Lys Met Thr 
15 10 15 

Gin lie Met Phe Glu Thr Phe Asn Thr Pro Ala Met Tyr Val Ala lie 
20 25 30 

Gin Ala Val Leu Ser Leu Tyr Ala Ser Gly Arg Thr Thr Gly Gin Tyr 
35 40 45 

lie Thr Thr Phe Phe Leu Tyr Arg 
50 55 



<210> 7 

<211> 191 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 7 

Ser Gly Asp Gly Val Ser His Thr Val Pro lie Tyr Glu Gly Tyr Ala 
15 10 15 

Leu Pro His Ala He Leu Arg Leu Asp Leu Ala Gly Arg Asp Leu Thr 
20 25 30 

Asp His Leu Met Lys He Leu Thr Glu Arg Gly Tyr Ser Phe Thr Thr 
35 40 45 

Thr Ala Glu Arg Glu He Val Arg Asp Met Lys Glu Lys Leu Ser Tyr 
50 55 60 

He Ala Leu Asp Phe Glu Gin Glu Leu Glu Thr Ser Lys Thr Ser Ser 
65 70 75 80 

Ser Val Glu Lys Ser Phe Glu Leu Pro Asp Gly Gin Val He Thr He 
85 90 95 

Gly Ala Glu Arg Phe Arg Cys Pro Glu Val Leu Phe Gin Pro Ser Met 
100 105 110 

He Gly Met Glu Asn Pro Gly He His Glu Thr Thr Tyr Asn Ser He 
115 120 125 

Met Lys Cys Asp Val Asp He Arg Lys Asp Leu Tyr Gly Asn He Val 
130 135 140 

Leu Ser Gly Gly Thr Thr Met Phe Asp Gly He Gly Asp Arg Met Ser 
145 150 155 160 



Lys Glu He Thr Ala Leu Ala Pro Ser Ser Met Lys He Lys Val Val 
165 170 175 



Ala Pro Pro Glu Arg Lys Tyr Ser Val Trp He Gly Gly Ser He 
180 185 190 



<210> 8 

<211> 13 

<212> PRT 

< 2 1 3 > Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 8 

Val Pro Asn Leu Gin Met Trp He Ala Lys Ala Glu Tyr 
15 10 



<210> 9 

<211> 26 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 9 

Asn Leu Asp Arg Gin Ser Ser Thr Gly Ser Ala Ser Asp Gin Lys Ser 
15 10 15 

Pro Ser Lys Thr Arg Ala Val Lys He Leu 
20 25 



<210> 10 

<211> 56 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 10 

Asn Ser Ser Ala Val Asn Phe Ser Thr Ser Tyr Thr Leu Ala He Arg 
15 10 15 

Leu Glu Leu Ser Ala Leu He Phe Leu He Ser Leu Glu He He Ser 
20 25 30 

Ser Ser He Lys Trp Gly Met Ala Ser Ser Ser He Cys Asn Ser Ser 
35 40 45 

Lys Leu Ser Met Lys Lys Gin Ser 
50 55 



<210> 11 

<211> 194 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 
<400> 11 

Ser Gly Asp Gly Val Ser His Thr Val Pro He Tyr Glu Gly Tyr Ala 
1 5 10 15 

Leu Pro His Ala lie Leu Arg Leu Asp Leu Ala Gly Arg Asp Leu Thr 
2° 25 30 

Asp His Leu Met Lys He Leu Thr Glu Arg Gly Tyr Ser Phe Thr Thr 
35 40 45 

Thr Ala Glu Arg Glu He Val Arg Asp Met Lys Glu Lys Leu Ser Tyr 
50 55 60 

He Ala Leu Asp Phe Glu Gin Glu Leu Glu Thr Ser Lys Thr Ser Ser 

65 70 -7ir 

75 80 

Ser Val Glu Lys Ser Phe Glu Leu Pro Asp Gly Gin Val He Thr He 
85 90 95 

Gly Ala Glu Arg Phe Arg Cys Pro Glu Val Leu Phe Gin Pro Ser Met 
100 105 110 

He Gly Met Glu Asn Pro Gly He His Glu Thr Thr Tyr Asn Ser He 
115 120 125 

Met Lys Cys Asp Val Asp He Arg Lys Asp Leu Tyr Gly Asn He Val 
130 135 140 

Leu Ser Gly Gly Thr Thr Met Phe Asp Gly He Gly Asp Arg Met Ser 
"-^^ 150 155 

Lys Glu He Thr Ala Leu Ala Pro Ser Ser Met Lys He Lys Val Val 
165 170 175 

Ala Pro Pro Glu Arg Lys Tyr Ser Val Trp He Gly Gly Ser He Leu 

185 190 

Ala Ser 



<210> 12 

<211> 9 

<212> PRT 

<213> Unknown 

<220> 

<223> Describes a predicted protein sequence 



<400> 12 



Gin Met Trp lie Ala Lys Ala Glu Tyr 
1 5 



<210> 13 
<211> 296 
<212> PRT 

<213> Arabidopsis thaliana 
<220> 

<223> Ecotype Columbia, describes actin 
<400> 13 

Met Glu Lys He Trp His His Thr Phe Tyr Asn Glu Leu Arg Val Ala 
15 10 15 

Pro Glu Glu His Pro Val Leu Leu Thr Glu Ala Pro Leu Asn Pro Lys 
20 25 30 

Ala Asn Arg Glu Lys Met Thr Gin He Met Phe Glu Thr Phe Asn Thr 
35 40 45 

Pro Ala Met Tyr Val Ala He Gin Ala Val Leu Ser Leu Ala Ser Gly 
50 55 60 

Arg Thr Thr Gly Gly He Val Leu Asp Ser Gly Asp Gly Val Ser His 
" 70 75 80 

Thr Val Pro He Tyr Glu Gly Tyr Ala Leu Pro His Ala He Leu Arg 
85 90 95 

Leu Asp Leu Ala Gly Arg Asp Leu Thr Asp His Leu Met Lys He Leu 
100 105 110 

Thr Glu Arg Gly Tyr Ser Phe Thr Thr Thr Ala Glu Arg Glu He Val 
115 120 125 

Arg Asp Met Lys Glu Lys Leu Ser Tyr He Ala Leu Asp Phe Glu Gin 
130 135 3^40 

Glu Leu Glu Thr Ser Lys Thr Ser Ser Ser Val Glu Lys Ser Phe Glu 
1^5 150 155 160 

Leu Pro Asp Gly Gin Val He Thr He Gly Ala Glu Arg Phe Arg Cys 
165 170 175 

Pro Glu Val Leu Phe Gin Pro Ser Met He Gly Met Glu Asn Pro Gly 
180 185 190 

He His Glu Thr Thr Tyr Asn Ser He Met Lys Cys Asp Val Asp He 
195 200 205 

Arg Lys Asp Leu Tyr Gly Asn He Val Leu Ser Gly Gly Thr Thr Met 
210 215 220 

Phe Gly Gly He Gly Asp Arg Met Ser Lys Glu He Thr Ala Leu Ala 



Pro Ser Ser Met 



Ser Val Trp He 
260 

Gin Met Gin Met 
275 

Ser He Val His 
290 



230 

Lys He Lys Val 
245 

Gly Gly Ser He 

Trp He Ala Lys 
280 

Arg Lys Cys Phe 
295 



235 

Val Ala Pro Pro 
250 

Leu Ala Ser Leu 
265 

Ala Glu Tyr Asp 



240 

Glu Arg Lys Tyr 
255 

Ser Thr Phe Gin 
270 

Glu Ser Gly Pro 
285 



